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(w) Improved strains of yeast for the expression of heterologous genes. 

(&7) The GALA protein is rate-limiting in quantity as a positive 
regulator for galactose-inducible promoters in strains of 
yeast. Novel strains are described in which the GALA protein 
can be overproduced in a regulatable fashion. These strains 
are useful for the regulatable expression in yeast of hetero- 
logous genes whose expression is driven by a galactose- 
inducible promoter. 
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TITLE OF THE INVENTION 

IMPROVED STRAINS OF YEAST FOR THE EXPRESSION OF 
HETEROLOGOUS GENES 

5 BACKGROUND OF THE INVENTION 

Saccharomvces cerevisiae has proven 
versatile as a host species for the expression of 
foreign polypeptides. Many different proteins from a 
variety of species have been expressed in £_i_ 

10 cerevisiae . some to levels of >10% of total cell 

protein. Typically, expression has been mediated by 
a plasmid containing yeast regulatory sequences 
(transcriptional promoter and terminator) 
circumscribing the structural gene for the expressed 

15 polypeptide as well as other sequences required for 
the selection and amplification of plasmids in both 
£L_ cerevisiae and in Escherichia coli . In addition, 
it has been possible to integrate the yeast 
regulatory sequences circumscribing the structural 

20 gene for the expressed polypeptide into a yeast 
chromosome and achieve high-level expression. 
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S . cerevisiae has A genes which encode the 
enzymes responsible for the utilization of galactose 
as a carbon source. The fiAfcl, QhL.2, SAL7, and fiALlO 
genes respectively encode galactokinase, galactose 

5 permease, a-D-galactose-l-phosphate uridyltransf er- 
ase, and uridine diphosphogalactose-4-epimerase . In 
the absence of galactose, very little expression of 
these enzymes is detected. If cells initially are 
grown on medium containing glucose, and galactose is 

10 added to the culture, these four enzymes are induced 
coordinately by at least 1000-fold upon depletion of 
glucose from the media. This induction has been 
shown to occur at the level of RNA transcription. 
The GAL 1 , GAL 2 , GAL 7 and GAL 10 genes have been 

15 molecularly cloned and sequenced. The regulatory and 
promoter sequences to the 5' sides of some of the 
reS p ec tive coding regions have been placed adjacent 
to the coding regions of the lac Z gene. These 
experiments have defined those promoter and 

20 regulatory sequences which are necessary and 

sufficient for galactose induction and which are 
useful for driving the expression of heterologous 
genes . 

The GAL 4 and GAL 80 gene products are 
25 respectively positive and negative regulators of the 
expression of the GAL 1, GAL 2 * GAL 7 , and Qhl> 10 genes 
as well as of the MEL 1 gene, which encodes 
a-galactosidase, an enzyme required for melibiose 
catabolism. The GAL 4 product acts as a positive 
30 regulator at the transcriptional level by binding to 
specific DNA sequences 5* to the structural 
information of these genes. In the absence of 
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galactose, the GAL 80 gene product (protein) interacts 
with the GAL 4 gene product or with a t)NA sequence to 
prevent this transcriptional activation event. In 
the presence of galactose, the GAL 80 protein 
5 apparently cannot antagonize the GAL 4 protein, and 
the GAL 4 protein can function as a transcriptional 
activator. The GAL 4 gene is expressed at very low 
levels, and its product is rate-limiting for the 
optimal induction of the GAL gene promoters. This is 
10 especially true when a cell contains multiple copies 
of plasmids with galactose- 
inducible prompters driving the expression of 
heterologous genes. However, if it is a goal to 
maintain minimal or no expression of a heterologous 
15 gene before the addition of galactose, it is 

important that the GAL 4 gene be expressed at very low 
levels in the absence of galactose. 

In a variety of recombinant microbial 
expression systems, the synthesis of many different 
20 heterologous polypeptides has been shown to be 

deleterious to the host cell. As a consequence, 
there is selective pressure against the expression of 
such heterologous polypeptides, such that the only 
cells which accumulate in a scale-up of such a 
25 recombinant culture are those which do not express 

the heterologous polypeptide or express so little of 
the heterologous polypeptide that the culture becomes 
an uneconomical source of that polypeptide. An 
optimal scheme for the scale-up of such a recombinant 
30 culture would be to maintain minimal or no expression 
of the heterologous gene during the expansion of the 
culture to a large volume and high cell density and 
then to induce the maximal expression of the 
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heterologous gene only in the final stage of culture 
growth prior to product isolation. 



QR.TRrTS OF THE INVENTION 

5 It is an object of the present invention to 

provide novel strains of yeast in which the 
protein can be overproduced in a regulatable fashion, 

such that the expression of heterologous genes driven 
by galactose-inducible promoters can be minimized 
10 during the expansion of the culture to a high cell 
density and maximized at the final stage of culture 
growth prior to product isolation. This and other 
objects of the present invention will be apparent 
from the following description, 

15 

PT7MMARY OF THE INVENTION 

The GAL 4 protein is rate-limiting in 
quantity as a positive regulator for galactose- 
inducible promoters in strains of yeast- Novel 
20 strains are described in which the GAL 4 protein can 
be overproduced in a regulatable fashion. These 
strains are useful for the regulatable expression in 
yeast of heterologous genes whose expression is 
driven by a galactose-inducible promoter. 

25 

FRTEF DESCRIP TION OF THE FIGURES 

Figure 1 depicts the preparation of pKHint-A. 
Figure 2 depicts the preparation of pKHint-B 

and pKHint-C. 
30 Figure 3 depicts the preparation of 

Sc252(pG£L10/S£L4) and Sc294 (pG^10/£AL4) . 

Figure 4 depicts the preparation of pRJ178. 
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Figure 5 depicts the preparation of pJC197. 
Figure 6 deplcdis the preparation of pKH207-l. 
Figure 7 depS&ts the preparation of pKH4 . 
Figure 8 depicts the preparation of pKH4-EBMA . 

5 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention is directed to the 
development of novel strains of yeast in which the 
SAL4 gene product can be overexpressed in a regulat- 

10 able fashion. The present invention also is directed 
to the use of such novel strains as a host for the 
expression of heterologous genes driven by a 
galactose-inducible promoter. Specifically, it is 
directed to conditions for the scale-up of such 

15 recombinant cultures such that heterologous gene 

expression can be minimized during the expansion of 
the culture to a large volume and high cell density 
and then maximized at the final stage of culture 
growth prior to product isolation, 

20 An expression cassette, containing the GAL 1Q 

promoter fused to the G&L4 structural gene, is 
constructed in such a way that cells which integrate 
this cassette at the HI£3 locus can be selected. The 
ELt- QQl% - £-i_ cerevisiae shuttle vector YEpSl contains 

25 the yeast GAL10 promoter sequences. This plasmid is 
digested with £ail3A and Sai l , and the 0.5 kbp CAL10 
promoter fragment is isolated by preparative agarose 
gel electrophoresis. Two oligonucleotide linkers are 
synthesized and ligated to the 0.5 tefcp; fragment, thus 

30 converting the 5* £au3A and the 3' Sail ends to Eco RI 
and fialll ends, respectively. After subsequent 
digestion with ECflRI and fialll, the 0.5 kbp 
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Eco RI-Bfllll GAL 10 promoter fragment is isolated by 
preparative agarose gel electrophoresis. The plasmid 
vector pJJ42 contains the yeast HIS3 gene. The yeast 
GAL 4 structural gene (including the translational 

5 start site) is isolated from pSJ3 and subcloned to 
form pIA. This plasmid is digested with fiaiDHI and 
HlMHI/ and the 2.9 kbp S&L4 fragment is isolated by 
preparative agarose gel electrophoresis. The plasmid 
vector pJJ42 contains the yeast HI£3 gene. This 

10 plasmid is digested with EfiflRI and Hindlll to remove 
a segment of the H1&3 gene extending from the middle 
of the coding region to just upstream of the HX£3 
promoter. The 12 kbp vector fragment is isolated by 
preparative agarose gel electrophoresis and ligated 

15 simultaneously with the 0.5 kbp and 2.9 kbp fragments 
described above to yield the plasmid pKHint-A. The 
p£AL10/£Mi4 expression cassette is removed from 
pKHint-A by digestion with pam HI, and the 5.6 kbp 
fragment is isolated by preparative agarose gel 

20 electrophoresis. The plasmid vector pJJ98, a 

modified pBR322 lacking the Hind lll site, is digested 
with Bam HI and ligated to the 5.6 kbp cassette. The 
resulting plasmid pKHint-B contains a unique Hin£III 
site located beyond the carboxy-terminal end of the 

25 GAL 4 coding sequence. The fully functional yeast 
URA 3 gene is isolated from the g.o 1 i - SL. 
cerevisiae shuttle vector YEp24 by digestion with 
Hjjl£III and separation of the 1.2 kbp fragment by 
preparative agarose gel electrophoresis. After 
30 digestion with fijjuaill, plasmid pKHint-B is ligated 
to the 1.2 kbp HindHI fragment carrying the LZR&3 
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gene to yield pKHint-C, which now contains the 
complete expression cassette for integration into the 
yeast chromosome. The salient features of this 
cassette are: 1) yeast regulatable SALIO promoter, 
5 2) yeast GAL 4 translational start site and structural 
gene, 3) yeast URA 3 promoter and structural gene for 
positive selection in ill&3~ yeast host, and 4) two 
BaiDHI sites which cleave within the 5 § and 3' non- 
translated sequences of the HI£3 gene for integrative- 
10 replacement transformation into the yeast Hl£3 locus. 

When digested with fiaiQHI, the plasmid 
pKHint-C yields a 6.8 kbp expression cassette flanked 
on either end by nontranslated HIS 3 DNA sequences. 
The digested DNA is used to transform S-u cerevisiae 
15 host strains Sc252 ( MAT a, ima3-52, igu2-2-112, ade l, 
MEL D and Sc294 ( MATc t, uxfi3-52, leu 2-2-112, 
Agill-10-7, HEL1) . Acting as a targeting device, 
the Bam HI ends direct the expression cassette to the 
chromosomal HIS 3 locus where the cassette integrates 
20 via homologous recombination. Transf ormants are 
selected for URA 3* and screened for his 3~ . 
Southern blot analysis of the clonal isolates, 
referred to as Sc252 (p GAL 10/ Ga! 4) and Sc294 
(pQMilO/fiAIi4 ),. confirms that the cassette has 
25 integrated at the HIS 3 locus. Transf ormants and 
their respective parental strains are grown in 
glycerol-lactic acid medium either with or without 2% 
(w/v) galactose. After harvesting at A 600 « 1.5, 
RNA extracts are prepared and analyzed by Northern 
30 blot techniques. In the presence of galactose, 

transcription of GAL 4 mRNA is observed to be at least 
20-fold enhanced in the Sc252 (pGAL10/ GAL 4 ) and Sc294 
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(p£AL10/SAL4) strains over their respective parental 
strains. The strains show basal-level expression of 
GAL 4 mRNA in the absence of galactose. Thus, the 
novel yeast strains display the desired phenotype of 
5 basal-level £AL4 expression in the absence of 

galactose and the higher (than wild-type strains) 
level expression of GAL 4 in the presence of galactose. 

In order to test whether the novel yeast 
strains afford an enhancement over wild-type strains 
10 of the levels of expression of a heterologous gene, 

an expression vector containing a galactose inducible 
promoter is prepared for transforming the strains. 
The salient features of this plasmid are: 
i)\ Escherichia ££li-derived sequences for the 
15 selection and amplification of the plasmid in 

EjX coli , 2) ce revi s i a e -de rived sequences for the 
selection and amplification of the plasmid in yeast, 
3) yeast fi&L10 promoter, 4) yeast alpha mating factor 
pre-leader for directing the translational product 
20 into the rough endoplasmic reticulum, 5) yeast alpha 

mating factor pro-leader which encodes N-glycosylation 
signal sequences and which is cleaved by the KE % 2 and 
gTE 13 proteases (gene products), 6) the coding 
sequence for the Epstein-Barr virus membrane antigen 
25 (EBMA) proteins, which are 350,000 and 220,000 dalton 
glycoproteins encoded by EB virus (EBV) and resident 
on the surface of EBV-infected cells and EB virions, 
7) a translational termination sequence, and 8) a 
yeast transcriptional termination sequence. This 
30 recombinant plasmid is introduced to £Lu gerevisiae 

strains which are wild-type for SAL4 and to isogenic 
S . perevisiae strains which are harboring the 
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integrant, and transformed clones are 
selected. Cells are grown in synthetic selective 
glycerol-lactic acid medium; galactose is added to 
the cultures to induce expression of the EBMA gene. 
5 RNA is prepared from both strains and subjected to 
Northern blot analysis. The mRNA encoding EBMA is 
amplified at least 4-5 fold in the pfiM*10/GAL4 
integrant over the wild-type strain. Lysates are 
prepared, electrophoreses in polyacrylamide gels, and 
10 Western blotted to nitrocellulose. EBMA proteins 
reactive specifically with human convalescent EBV 
sera are amplified approximately 10-fold in the 
p GAL 10/ GAL 4 integrant over the wild-type strain. No 
EBMA mRNA or EBMA protein is detectable in the 
15 p GAL 10/ GAL 4 integrant strain prior to the addition of 
galactose to the culture. 

The GAL 4 protein functions as a 
transcriptional activator of galactose-inducible 
promoters and does not a priori influence the way in 
20 which heterologous gene products whose expression is 
driven by galactose-inducible promoters would be 
translated or processed post-translationally . 
Therefore, it is obvious to those skilled in the art 
that the concept of utilizing a yeast strain in which 
25 the GAL 4 protein can be overproduced in a regulatable 
fashion as a host for heterologous gene expression 
extends to any heterologous gene whose expression is 
driven from a galactose-inducible promoter. Further- 
more, it is obvious to those skilled in the art that 
30 any one of the galactose-inducible promoters, GAL 1 * 
SAL2, C&L7, GAL10, or all being subject to 

GAL 4-mediated transcriptional activation, can be 
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utilized to drive the expression of heterologous 
genes in a yeast host in which the GAL 4 protein can 
be overproduced in a regulatable fashion. Another 
key concept is that the GAL4 protein overproduction 

5 should be regulatable, meaning that GAL 4 gene 

expression should be driven by a promoter whose 
activity is regulatable by any physiological means, 
including but not limited to the addition of 
galactose to a culture. Therefore, it is obvious to 

10 those skilled in the art that a novel strain in which 
the GAL 4 protein can be overproduced in a regulatable 
fashion would contain a GAL 4 gene whose expression is 
driven by a promoter other than one which is 
galactose-inducible, including but not limited to the 

15 promoters of the alcohol dehydrogenase 2, acid 

phosphatase, phosphog lycerate kinase, invertase, 
yeast heat shock and copperthionein genes. 
Furthermore, it is obvious to those skilled in the 
art that those sequences containing a regulatable 

20 promoter and the GAL 4 coding sequence can reside in 
the yeast host either in an integrated form or on a 
plasmid and still allow for the regulatable 
overproduction of the GAL 4 protein. 

The genus Saccharomvces is composed of a 

25 variety of species. Most commonly used is 

Saccharomvces cerevisiae , or baker's yeast, as a host 
for the recombinant-DNA mediated expression of a 
variety of foreign polypeptides. However, the 
distinctions among the other species of the genus 

30 Saccharomvces are not always well-defined. Many of 
these species are capable of interbreeding with 
cergvisiae and have been demonstrated to possess 
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galactose-inducible genes and positive regulatory- 
proteins for the transcriptional activation of such 
genes. The positive regulator, which in p f 
cerevisiae is known as GAL4, may be of different 
5 structure or name from GAL 4 and still function as a 
transcriptional activator of galactose- 
inducible promoters. Therefore, it will be obvious 
to those skilled in the art that the concept of 
utilizing a strain in which the fiMi4 protein, or 
10 other positive regulator for transcriptional 

activation of galactose-inducible promoters, can be 
overproduced in a regulatable fashion and can be 
extended to other species of the genus Saccharomvces 
including but not limited to car lsberaensis , 
15 norkensiS, diastaticus , ovif ormis . uvarum . rouxii . 
rPQnta n us, kluyveri , and elonaisporus . 

Several yeast genera, such as Hansenula . 
Candida, TprvlQPSiS/ and Pichia, have been shown to 
contain similar metabolic pathways for the utiliza- 
20 tion of methanol as a sole carbon source for growth. 
The gene for alcohol oxidase, an enzyme which 
participates in this metabolic pathway, has been 
isolated from Pichia pastoris . The pastoris 
alcohol oxidase promoter has been isolated and shown 
25 to be sensitive to methanol induction of expression. 
The promoter has been placed on an expression vector 
where it has been shown to promote the expression of 
foreign genes cloned adjacent to it. Thus, the 
principle of utilizing an inducible promoter to 
30 direct expression of foreign genes in species of 

other yeast genera is well-established. Moreover, 
species of other yeast genera can utilize a variety 
of carbon sources, including galactose, for growth. 
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Therefore, it will be obvious to those skilled in the 
art that the concept of increased and regulated 
expression of a positive regulatory element as a 
transcriptional activator of galactose inducible 

5 promoters to increase expression of a heterologous 
gene whose expression is driven by a galactose- 
inducible promoter extends the range of suitable 
yeast hosts to species of yeast from the families 
Saccharomvcetaceae and Cr yptococcaceae , including but 

10 not limited to species from the genera Pichia, 

Candida . Hansenula , Torulopsis, KluyveromYCes / and 
Saccharo rnvcopsis . 

The following examples illustrate the 
present invention without, however, limiting the same 

15 thereto. The disclosure of each reference mentioned 
in the following examples is hereby incorporated by 
reference . 



EXAMPLE 1 

20 Preparation of a p G AL 10/ G AL 4 expression 

cassette for integrative transformation 

The ILu coli - cerevisiae shuttle vector 
YEpSl (Broach £t £l. , Experime ntal Manipulation of 
Gene Expression , p. 83, Academic Press, 1983) 

25 contains the yeast GAL 10 promoter sequences- This 

plasmid was digested with Sau 3A and Sai l, and the 0.5 
kbp GAL 10 promoter fragment was isolated by prepara- 
tive agarose gel electrophoresis (Figure 1, top). 
Two oligonucleotide linkers were synthesized and 

30 ligated to the 0.5 kbp fragment, thus converting the 
5' Sau 3A and the 3* Sai l ends to Fc qRI and Pqjjll 
ends, respectively. The respective structures of 
these oligonucleotides are 




• 
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CTGAATTC 



GACTTAAGCTAG 



and 



TCGAGATCTTAG 



CTAGAATC 



After subsequent digestion with E££RI and ficQ.II, the 
10 0.5 kbp Eco RI- BqI II GAL 10 promoter fragment was 

isolated by preparative agarose gel electrophoresis 
(Figure 1, top) . 

In order to isolate the GAL 4 structural gene 
free of its own transcriptional control sequences, 

15 pIA (Figure 1, middle) was constructed according to 
the following procedure. pYe(CEN3)41 [Clark and 
Carbon, Nature 287 : 504 (1980)] was digested with 
PstI and the large fragment self-ligated to remove 
the LEU 2 gene and reconstitute the bla gene. The 

20 resulting plasmid pYC was digested with fi^mHI and 
Sai l and the large Bam HI- Sal l fragment was ligated 
with a 2.4 kbp Bol II- Sal l fragment bearing the yeast 
LEU 2 gene, which had been isolated by digestion of 
YEpl3 [Broach ££ al. . Gene 8: 121 (1979)] with Bql ll 

25 and Sai l . As a result of this ligation, the Bam HI 
site is destroyed in the resulting plasmid pYCL-1. 
Plasmid pAAR6 (Ammerer, Methods in Enzymology 101 : 
192 (1983)] was digested with £amHI and HinflHI # , and 
the 1.5 kbp Bam HI- Hind lll fragment containing the 

30 yeast alcohol dehydrogenase I (ADHI) gene promoter 
was isolated. The plasmid pYCL-1 was then digested 
with BqI II and Hin£III and the resulting large vector 
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fragment was ligated with the 1.5 kbp Bj^I-Hindlll 
ADHI promoter fragment to yield pYCLF , which does not 
contain a BaffiHI site. The G&k* gene then was 
isolated from pSJ3 [Johnston and Hopper, Proc. Natl. 
5 Acad. Sci., U.S.A. Z&: 6971 (1982)] by digestion 

with Hind lll. The resulting 3.3 kbp Bind.HI fragment 
containing the SAL4 gene then was inserted into the 
unique Hind.HI site of pYCLF in the appropriate 
orientation, such that the AJ2HI promoter was upstream 
10 of the GAL 4 structural gene. The resulting plasmid 
YCpLF+4 contains a unique B_amHI site which is 
equidistant (approximately 0.45 kbp) from the 3' -end 
of the ADH I promoter and the translational start of 
the GAL 4 gene. YCpLF+4 was cut at the EamHI site and 
15 then treated with exonuclease Bal-31 to generate 

various length deletions. The resulting ends were 
made flush-ended with the Klenow fragment of DNA 
polymerase I. Bam HI linkers then were added and the 
ends religated, producing a family of plasmids with 
20 various fusions of the ADH I promoter to the S&L* 

gene. Subsequent characterization of the resulting 
plasmids led to the isolation of a particular plasmid 
pIA in which the BamHI site at the 3' -end of the ADHI 
promoter was located five base pairs upstream of the 
25 ATG translational initiation codon for the GAI.4 
structural gene. The yeast fiAI.4 structural gene 
(including the translational start site) was isolated 
from the plasmid pIA. This plasmid was digested with 
BamHI and Hind lll and a 2.9 kbp fiM.4 fragment was 
30 isolated by preparative agarose gel electrophoresis 

(Figure 1, middle). The plasmid vector pJJ42 [Struhl 
et a 1. , Cold Spring Harbor Symp. Quant. Biol., 42: 
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901 (1982)] contains the yeast HI£3 gene. This 
plasmid was digested with EcoR I and Hind lll which 
cleave within the fil£3 promoter and within the HIS 3 
gene, respectively, A 12 kbp vector fragment was 

5 isolated by preparative agarose gel electrophoresis 

and ligated simultaneously to the 0.5 kbp and 2.9 kbp 
fragments described above to yield the plasmid 
pKHint-A (Figure 1, bottom). The P GAL 10/ GAL 4 
expression cassette was removed from pKHint-A by 

10 digestion with BamHI and isolation of the 5.6 kbp 

fragment by preparative agarose gel electrophoresis 
(Figure 2 # top). The plasmid vector pJJ98, a 
modified pBR322 lacking the HinflHI site, was 
digested with Bam HI and ligated to the 5.6 kbp 

15 fragment (Figure 2, middle). The resulting plasmid 
pKHint-B contains a unique Hind lll site located at 
the carboxy-terminal end of the GAL 4 coding 
sequence. The fully functional yeast URA 3 gene was 
isolated from the coli - fij_ cerevisiae shuttle 

20 vector YEp24 [Botstein g£ &1. , Cell 8: 17 (1979)] by 
digestion with Hind lll and separation of the 1.2 kbp 
fragment by preparative agarose gel electrophoresis 
(Figure 2, middle). After digestion with Hind lll , 
plasmid pKHint-B was ligated to the 1.2 kbp Hinsilll 

25 fragment carrying the URA 3 gene to yield pKHint-C, 
which now contains the complete expression cassette 
for integration into the yeast chromosome. The 
salient features of this cassette are: 1) yeast 
regulatable GA^IO promoter, 2) yeast Q£J^4 
30 translational start site and structural gene, 3) 

yeast URA 3 promoter and structural gene for positive 
selection in ura 3~ yeast host, 4) two Bam HI sites 
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which cleave within the 5' and 3 1 nontranslated 
sequence of the HIS 3 gene for integrative 
transformation into the yeast HIS 3 chromosomal locus. 



5 EXAMPLE 2 

Preparation of novel yeast strains containing 
an integrated PGAL10/GAL4 expression cassette 
When digested with Bam HI , the plasmid 
pKHint-C yielded a 6-8 kbp expression cassette 

10 flanked on either end by nontranslated HIS 3 DNA 

sequences (Figure 3, top). The digested DNA was used 
to transform £j_ cerevisiae host strains Sc252 ( MAT a . 
ura 3-52, leu 2-2-112 , ade l , MEL 1) [Johnston fit &1 . / 
Proc. Natl. Acad. Sci . U.S.A., 7jJ: 697 (1982)] and 

15 Sc294 (Mata, ura 3-52 , lfiii2-2-112, A cral l-10-7, 

MEL D . Acting as a targeting device, the Bam HI ends 
directed the expression cassette to the chromosomal 
HIS 3 locus where the cassette integrated via 
homologous recombination (Figure 3, middle). 

20 Transf ormants were selected for URA 3* and screened 
for his 3"~ . Southern blot analysis of the clonal 
isolates confirmed that the cassette had integrated 
at the HIS 3 locus. Transf ormants and their 
respective parental strains were grown in synthetic 

25 selective (leu~) glycerol-lactic acid medium 

[Sherman fit Methods in Yeast Genetics . & 

laboratory Manual . Cold Spring Harbor Press (1979)], 
either with or without 2% (w/v) galactose. After 
harvesting at A 60 ^ = 1.5, RNA extracts were 

30 prepared [Zitomer fit Proc. Natl. Acad. Sci. 

U.S.A. 76: 627 (1979)] and analyzed by Northern blot 
techniques. [Thomas, Proc. Natl. Acad. Sci. U.S.A. 
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77 : 5201 (1980)]. In the presence of galactose, 
transcription of GAL 4 mRNA is observed to be at least 
20-fold enhanced in the Sc252 (p£&L10/SAL4) and Sc294 
(PGAL10/ GAL 4) strains over their respective parental 

5 strains. The strains show basal-level expression of 
GAL 4 mRNA in the absence of galactose. Thus, the 
novel yeast strains display the desired phenotype of 
basal-level GAL4 expression in the absence of 
galactose and much higher (than wild-type strains) 

10 level expression of GAL 4 in the presence of galactose. 

EXAMPLE 3 

Preparation of an expression vector with a GAL 10 
promoter directing expression of EBMA 
15 The alpha mating factor (a factor) gene 

MFal was cloned onto a plasmid vector from S . 
cerevisiae genomic DNA [Kurjan .al. , Cell 3H: 933 

(1982) ; Singh £i al. , Nucleic Acids Res. H: 4049 

(1983) ]. The resulting plasmid pKH2 was digested 

20 with E££RI and the 1.7 kbp fragment bearing the alpha 
mating factor gene was purified by preparative 
agarose gel electrophoresis (Figure 4, top). Plasmid 
pRJ148 (a modified pBR322 lacking the Hindi I I site) 
was digested with Eco RI and ligated to the 1.7 kbp 

25 fragment to yield the plasmid pRJ159 (Figure 4, 
middle). This DNA was digested with Hindlll and 
self-ligated to form plasmid pRJ167, which now has a 
unique HiMIII site (Figure 4, middle). Plasmid 
pRJ167 was digested with HinslIII and modified by the 

30 insertion of a synthetic oligonucleotide adaptor to 
yield a new plasmid (pRJ178) containing a unique 
Hind lll site which is to the 3' side of the promoter 
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and pre-pro-leader and to the 5' side of the 
translational termination signals in all three 
reading frames (Figure 4, bottom). The Hinfllll site 
was converted to a fiamHI site by digestion with 
5 Hind lll , flush-ending with the Klenow fragment of DNA 
polymerase I, addition of fLaillHI linkers and 
self-ligation to form plasmid pJC193 (Figure 5, 
top). This plasmid was digested with EcfiRI, 
flush-ended with the Klenow fragment of DNA 
10 polymerase 1, modified by the addition of 

linkers, digested with ficll, and the 1.5 kbp fragment 
bearing the alpha mating factor gene isolated by 
preparative agarose gel electrophoresis (Figure 5, 
middle) . This resulting pel l fragment then was 
15 treated with calf intestine alkaline phosphatase and 
inserted into the unique Bam HI site of pCl/1, 
destroying the original Bam HI site in the process 
(plasmid pJC194; Figure 5, bottom). This DNA was 
digested with 3am HI and self-ligated to remove excess 
20 Bam HI linkers (plasmid pJC197, Figure 5, bottom). 

pJC197 was digested with Eco RI and Pst I . A 
0.7 kbp fragment containing a portion of the alpha 
mating factor pre-pro-leader , a three-frame transla- 
tional terminator and the alpha mating factor 
25 transcriptional terminator was isolated by prepara- 
tive agarose gel electrophoresis (Figure 6, middle 
left). The 1Li_ coli - S^. ^erevisiae shuttle vector 
YEpSl contains the yeast GAL 10 gene promoter 
sequences. This DNA was digested with &aii3A, made 
30 flush-ended with the Klenow fragment of DNA 

polymerase I, and ligated with synthetic fiamHI 
linkers of the structure: 
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CGGATCCG 
GCCTAGGC 

The linkered DNA was digested with Sai l and a 0.5 kbp 

5 GAL 10 promoter fragment was isolated by preparative 

agarose gel electrophoresis (Figure 6, top left). A 

35 bp £sJLI - Esil synthetic oligonucleotide adapter 

was synthesized, having the structure: 

1 0 TCGACCAAAAGAATGAGATTTCCTTCAATTTTTACTGCA 

GGTTTTCTTACTCTAAAGGAAGTTAAAAATG 

This adapter contains 11 base pairs of the alpha 
mating factor non-translated leader and the ATG and 
15 first 8 amino acids of the alpha mating factor 

pre-pro-leader . The 0.5 kbp GAL 10 promoter and the 
35 bp adapter fragments were ligated together and 
digested with Bam HI to generate a 5* Bam HI end. The 
resulting 0.5 kbp Bam HI- Pst I fiALlO promoter-alpha 
20 mating factor fragment was isolated by preparative 

agarose gel electrophoresis (Figure 6, middle). The 
plasmid vector pBR322 was digested with Eco RI and 
Bam HI , and a 4.0 kbp vector fragment was isolated by 
preparative agarose gel electrophoresis* The 4.0 
25 kbp, 0.7 kbp and 0.5 kbp fragments were ligated 

together to create the plasmid pKH207-l (Figure 6, 
bottom left). pKH207-l was digested to completion 
with Eco RI and partially digested with fiflUiHl . The 
1.2 kbp Bam HI - Eco RI GAL 10 promoter-alpha mating 
30 factor expression cassette was isolated by 

preparative agarose gel electrophoresis and made 
flush-ended with the Klenow fragment of DNA 
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polymerase I (Figure 7, middle). The IL*_ CQli - fi^. 
r.prpvisiae shuttle vector pCl/1 [Beggs, Nature 225.' 
104 (1978); Rosenberg e_t fil.. Nature 3_L2_: 77 (1984)] 
was digested with B_ajnHI and £a_LI, made flush-ended 
5 with the Klenow fragment of DNA polymerase I, and 
ligated with the 1.2 kbp cassette to create a new 
expression plasmid, pKH4 (Figure 7, bottom right). 

The B68' plasmid [Hummel st fil. , J. Virol. 
49 ; 4i3 (1984)] carrying the Bam HI-L DNA fragment 
10 containing the EBMA gene was digested with Xiio.II and 
Sea l , and the 2.5 kbp fragment containing the EBMA 
structural gene was made flush-ended with the Klenow 
fragment of DNA polymerase I (Figure 8, top). The 
expression vector pKH4 was digested with fiamHI and 
15 made flush-ended with the Klenow fragment of DNA 

polymerase I. The 2.5 kbp EBMA fragment was ligated 
to the vector. This recombinant plasmid pKH4-EBMA 
(Figure 8, bottom) has the following salient 
features: (1) coli -derived sequences for the 
20 selection and amplification of the plasmid in 

E.X coli . (2) rerevisiae -derived sequences for the 
selection and amplification of the plasmid in yeast, 
(3) yeast GAL 10 promoter, (4) yeast alpha mating 
factor pre-leader for directing the translational 
25 product into the rough endoplasmic reticulum, (5) 
yeast alpha mating factor pro-leader which encodes 
N-glycosylation signal sequences and which is cleaved 
by the KEX 2 and £££13 proteases (gene products) 
[Julius e_t &!•/ Cell 22.'. 839 (1983); Julius e_t al* , 
30 Cell 3_7: 1075 (1984)], (6) the coding sequence for 
EBMA, (7) a translational termination sequence, and 
(8) a transcriptional termination sequence. 
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EXAMPLE 4 

Expression of the EBMA gene in novel 
yeast strains containing an integrated 

5 The expression vector pKH4-EBMA (Figure 8, 

bottom) was introduced to JL. cerevisiae strains 
Sc252, SC252 (pfiAL10/fiAL4 ) and Sc294 (pfiAL10/fi&L4 ) . 
The transf ormants (with pKH4-EBMA) were grown in 
synthetic selective glycerol-lactic acid medium to 
10 A 600 - 0.5-1.0, at which time galactose was added 

to a final concentration of 2% (w/v) . Lysates were 
prepared, electrophoreses in polyacrylamide gels and 
Western blotted to nitrocellulose. The 375,000 and 
225,000 and 175,000 dalton proteins were found to be 
15 specific to EBMA by virtue of their presence only in 
transf ormants , their reactivity with human 
convalescent EBV sera and an EBMA-specif ic monoclonal 
antibody [Qualtiere £t il., Proc. Natl. Acad. Sci. 
79 : 616 (1982)], and their lack of reactivity with 
20 pooled negative human sera. No EBMA-specif ic 

proteins were detected in cultures grown in the 
absence of galactose. At 24 hours after the addition 
of galactose, the 2 strains Sc252 (pGM*10/fiAL4) and 
Sc294 (pGAL10/fiAL4) each contain 10-fold more 
25 EBMA-specif ic proteins than does strain Sc252. 



30 
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WHAT IS CLAIMED IS : 

1. A strain of species of the families 
Saccharomycetaceae or Crvptococcaceae containing 

5 sequences comprised of a physiologically regulatable 
promoter linked to the gene for a positive regulatory 
element which serves as the transcriptional activator 
of galactose-inducible promoters/ said strain being 
the host for a heterologous gene whose expression is 

10 driven by a galactose-inducible promoter, 

2. A strain according to Claim 1, wherein 
the host is a species of the genus Saccharomvces . 

15 3. A strain according to Claim 2, wherein 

the host is Saccharomvces cerevisiae . 

4. A strain according to Claim 1, wherein 
the promoter linked to the positive-regulatory 

20 element is galactose-inducible. 

5. A strain according to Claim 4, wherein 
the promoter is <2M±10 . 

25 6. A strain according to Claim 1, wherein 

the gene for the positive regulatory element is GAL 4 , 



7. A strain according to Claim 1, wherein 
the galactose-inducible promoter is GAL 1 . GAL 7, 
30 GAL 10 m or MEL1. 
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